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Abstract 

In this paper a new lifetime distribution which is obtained by compounding Lindley and geomet- 
ric distributions, named Lindley-geometric (LG) distribution, is introduced. Several properties 
of the new distribution such as density, failure rate, mean lifetime, moments, and order statistics 
are derived. Furthermore, estimation by maximum likelihood and inference for large sample are 
discussed. The paper is motivated by two applications to real data sets and we hope that this 
model be able to attract wider applicability in survival and reliability. 

Keywords: Bathtub failure rate, EM algorithm. Geometric distribution, Lindley distribution, 
Maximum likelihood estimation, Unimodal failure rate. 
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1. Introduction 

The Lindley distribution specified by the probability density function (p.d.f.) 

^(^) = ^(1 + ^)""'"' ^>0' ^>0' 



was introduced by Lindley [13(]. The corresponding cumulative distribution function (c.d.f.) is 
given by 

F{x) = l-{l + ^)e-'-, x>0, 0>O. (2) 

The Lindley distribution, in spite of little attention in the statistical literature, is important 
for studying stress-strength reliability modeling. Besides, some researchers have proposed new 
classes of distributions based on modifications of the Lindley distribution, including also their 
properties. Sankaran j^] introduced the discrete Poisson-Lindley distribution by combining 
the Poisson and Lindley distributions. Ghitany et al. [oj] investigated most of the statistical 



Email address: emahmoudiOyazduni . ac . ir (Eisa Mahmoudi) 



Preprint submitted to Computational Statistics and Data Analysis 



March 2, 2013 



properties of the Lindley distribution, showing this distribution may provide a better fitting than 
the exponential distribution. Mahmoudi and Zakerzadeh [l^ proposed an extended version of 
the compound Poisson distribution which was obtained by compounding the Poisson distribution 
with t he g eneraUzed Lindley distribution which is obtained and analyzed by Zakerzadeh and 
Dolati 251]. Recently a new extension of the Lindley distribution, called extended Lindley (EL) 



distribution, which offers a more flexible model for lifetime data is introduced by Bakouch et al. 



Adamidis and Loukas introduced a two-parameter lifetime distribution with decreasing 
failure rate by compounding exponential and geometric distributions, which was named expo- 



nential geometric (EG) distribution. In the same way, Kus 



121 and Tahmasbi and Rezaei 



24| 



introduced the exponential Poisson (EP) and exponential logarithmic distributions, respectively. 
Marshall and Olkin [3] presented a method for adding a parameter to a family of distributions 
with application to the exponential and Weibull families. 

Recently, Chahkandi and Ganjali sl introduced a class of distributions, named exponential 
power series (EPS) distributions, by compounding exponential and power series distributions, 
where compounding procedure follows the same way that was previously carried out by Adamidis 
and Loukas [l]; this class contains the distributions mentioned before. Extensions of the EG 
distribution was given by Adamidis et al. and Barreto-Souza et al. 0|, where the last was 
obtained by compounding Weibull and geometric distributions. A three-parameter extension of 
the EP distribution was obtained by Barreto-Souza and Cribari-Neto 5|]. 

This new class of distributions has been received considerable attention over the two last 
years. Weibull power series (WPS), complementary exponential geometric (CEG), two-parameter 
Poisson-exponential, generalized exponential power series (GEPS), exponentiated Weibull- Poisson 
(EWP) and generalized inverse Weibull-Poisson (GIWP) distributions were introduced and stud- 

n , ,1 

led by Morals and Barreto-Souza [201], Louzada-Neto et al. yL4i]) Cancho et al. Mahmoudi 



and Jafari [16}], Mahmoudi and Sepahdar 17] and Mahmoudi and Torki [18| . 

In this paper, we introduce a new lifetime distribution by compounding Lindley and geo- 
metric distributions as follows: Consider the random variable X having the Lindley distribution 
where its pdf and cdf are given in ([1]) and 

Given N, let Xi,--- ,Xn be independent and identically distributed (iid) random variables 
from Lindley distribution. Let the random variable A'^ is distributed according to the geometric 



2 



distribution with pdf 

P(N = n) = (1 - n = 1, 2, • • • , < p < 1. 

Let Y = min(Xi, • • • , X^r), then the conditional cdf of Y\N = n is given by 

FY\N{y\n) = 1 - 

The Lindley-geometric (LG) distribution, denoted by LG(p, 6*), is defined by the marginal cdf 
of Y, i.e., 

1 _ (1 + Iy-)f,-Sy 

Fyiy) = , \,'t\ ^>0' ^<p<^- (4) 

l-p(l + 0:^)e 

The paper is organized as follows. In Section 2, the density function, survival and hazard 
rate functions of the LG with some of their properties are given. Section 3 provides a general 
expansion for the quantiles and moments of the LG distribution. Its moment generating func- 
tion is derived in this section. Section 4 provides the moments of order statistics of the LG 
distribution. Residual life and reversed residual functions of the LG distribution is discussed in 
Section 5. Section 6 is devoted to the Bonferroni and Lorenz curves of the LG distribution. In 
Section 7 we explain the probability weighted moments. Mean deviations from the mean and 
median are derived in Section 8. Estimation of the parameters by maximum likelihood via an 
EM-algorithm and inference for large sample are presented in Section 9. Applications to two 
real data sets are given in Section 10 and conclusions are provided in Section 11. 



(1 + 



-i + l- 



-ey 



(3) 



2. Density function, survival and hazard rate functions 

The probability density function of the LG distribution is given by 

/(2/) = ^(l-p)(l + yK'^[l-p(l + ^)e-'^]~', y>0, (5) 
where > and < p < 1. 

Even when p < 0, Equation ([5]) is a density function. We can then define the LG distribution 
by Equation ^ for any p < 1. Some special sub-models of the LG distribution ^ are obtained 
as follows. If p = 0, we have the Lindley distribution. When p — )• 1~, the LG distribution 
tends to a distribution degenerate in zero. Hence, the parameter p can be interpreted as a 
concentration parameter. LG density functions are displayed in Figure 1 for selected values of 
e and J3 = -2, - .05, 0, .05, .09. 



Theorem 1. The density function of the LG distribution is (i) decreasing for all values p and 
9 for which p > j-^, (H) unimodal for all values p and 9 for which p < jj^- 

Proof. See Appendix. 

The survival function and hazard rate function of the LG distribution, are given respectively 

by 

{1 - p){i + ^)e-^y 

S{y) = ^ |±ii , (6) 



and 
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Figure 2 provides the plots of the hazard rate function of the LG distribution for different values 
p = —2, — .05, 0, .05, .09 and 9 = .03, .08, 1, 3. We have the following results regarding 
the shapes of the hazard rate function of the LG distribution. The proof is provided in the 
Appendix. 

Theorem 2. The hazard function of the LG distribution in ^ is (i) bathtub- shaped ifp > j^^, 
(ii) firstly increasing then bathtub- shaped if p < j^:^- 

Proposition 1. The hazard rate function of the LG distribution in ^ tends to (g_(_i)(i_p) and 
9 where y ^ and y — > oo, respectively. 

Using the series expansion 
where |z| < 1 and /c > 0, the density function ([5]) can be demonstrated by 

/LG(y;p, 0) = ^(1 - + y)e''^y Y.^j + 1^(1 + j^y^''''^- (9) 

Various mathematical properties of the LG distribution can be obtained from ([9]) and the cor- 
responding properties of the Lindely distribution. 

In the following theorem, we give the stochastically ordering property of the random variable Y 
with LG distribution. 

Theorem 3. Gonsider the two random variables Yi and Y2 with LG{pi,9) and LG{p2,9) dis- 
tributions, respectively. 

(i) If Pi < P2, then SY,{t) < Sy-M C^i < Y2) and hy.it) < hy.it) (Yi < Y2). 

f (t) 

(ii) If pi > P2, then is decreasing in t, i.e., (Yi < Y2). 
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3. Quantiles and moments of the LG distribution 

Applying the equation F{x^) = ^, the ^th quantile of the LG distribution is the solution of 
equation 



1-pS, 9 + 1' 

which is used for data generation from the LG distribution. 

Suppose that Y ~ LG{p,6), using the equation ([9]) and applying the binomial expression for 
(1 + , the rth moment of Y is given by 

f(l-p)f.^/A 9 , T{r + i + l) r + i + l 

j — i — 

Using Eq. ([TO]) . the moment generating function of the LG distribution is given by 



_ef(i-p)v^oo v^oo I i\„?7 g Y r(fc+i+i) fc+i+i N 

— e+i z^fc=o z^i=o z^i=o fe! I J V "T J-J/^ ve+i^ (6»{j+i))fe+'+i v-^ eo'+i)''- 

Proposition 2. The mean of the LG distribution ig given by 
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4. Order statistics and their moments 

Order statistics make their appearance in many areas of statistical theory and practice. Order 
statistics are among the most fundamental tools in non-parametric statistics and inference and 
play an important role in quality control testing and reliability, where a practitioner needs to 
predict the failure of future items based on the times of a few early failures. 
Let Yi, - ■ ■ ,Yn be a random sample taken from the LG distribution and Yim, ■ ■ ■ ,Yn:n denote 
the corresponding order statistics. Then, the pdf fr:n{y) of the rth order statistics Yr.n is given 
by 

fr-niy) = Be(..nUl) - P)i^ + ^)^'^^ ^7=0 Ego(-l)^ (1) ^^^^^0' + 1^ 

(\ 71 — 7'-\-i-\- i 
l + ^j e'in-r+i+j+l)ey ^ 
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where Be{a,b) = JqW'^ ^(1 ~ '^T ^^"^ the beta function. After some calculations and using 



the binomial expression for ( 1 + 



n-r+i+j+l 



, we have 



fr-n{y) = Be(..nUl) &(l - P)i^ + V)^''' ^T=0 ELo E£o E™=0^^^^'^Vl)^ r^V.^'^') 

The kth moment of the rth order statistic Yr;n can be obtained from the known result, 

-^[^r*^n] = Be{r,n-r+l)'ir[(^ ~ P) ^i^O Z]i=0 Hl=0 



^oo s^n—r+i+j+l I 



x(j + iy+'(i-pr--+^u^ 



r(fc+m+2) 



(6»(n-r+i+i+Z+l))'=+'"+2 + (6l(n-r+i+j+Z+l))'=+'"+ 



m I \i) 
r(fc+m+l) 



5. Residual life and reversed failure rate function of the LG distribution 

Given that a component survives up to time t > 0, the residual life is the period beyond t 
until the time of failure and defined by the conditional random variable y — t|y > t. In reliability, 
it is well known that the mean residual life function and ratio of two consecutive moments of 
residual life determine the distribution uniquely (Gupta and Gupta, [3]). Therefore, we obtain 
the rth-order moment of the residual life via the general formula 

1 



Hrit) = E\{Y -tY\Y >t\ 



{y-tYf{y)dy, 



S{t) jt 

where S{t) = 1 — F{t), is the survival function. 

Applying the binomial expansion to (y — ty into the above formula gives 

k r 



^^r{t) 



(e+i)S{t) 



r + k-i+2 



x{T{r + k-i + 2; et{j + 1)) + e{j + l)r(r + k-i + l- et{j + 1)) 



where r{s;t) = ^dx, shows the upper incomplete gamma function. 

The mean residual life (MRL) of the LG distribution is given by 



{e+i)S{t) ^0 



fc+3 



(Vik + 3; dt{j + 1)) + e{j + l)r{k + 2; et{j + 1)) 



t. 



In particular, we obtain 



i + l 



+ 



i + 2 



r > 1, 



(12) 



(13) 



(14) 



Also, if p = 0, then 



flit) 



2 + e + et 



e{i + e + ety 

which is the MRL function of the original Lindley distribution. The variance of the residual life 
of the LG distribution can be obtained easily by using /i2(i) and 

The reversed residual life can be defined as the conditional random variable t—Y\Y < t which 
denotes the time elapsed from the failure of a component given that its life is less than or equal 
to t. This random variable may also be called the inactivity time (or time since failure); for more 
details one can see (Kundu and Nanda, [11]; Nanda et al., ^^\)- Also, in reliability, the mean 
reversed residual life and ratio of two consecutive moments of reversed residual life characterize 
the distribution uniquely. Using and ([5]), the reversed failure (or reversed hazard) rate 
function is given by 



r(y) 



f{y) 



e+i 



(l-p)(l + y)e 



-ey 



y >o. 



(15) 



1-P(1 + 5TT. 

It is noticed that /i(0) = oo and /i(0) is discontinuous in the parameters of the LG distribution. 
The rth-order moment of the reversed residual life can be obtained by the well known formula 

1 



mr{t) = E{{t-YY\Y <t\ 



m Jo 



{t-yYf{y)dy, 



hence, 
mr{t) 



{e+i)F{t) 



k r 



(0(j + l))'-+'=-»+2 



X 7(r + /c - i + 2; Btij + 1)) + e{j + l)-f{r + k - i + 1; et{j + 1)) 



r > 1, 



(16) 

where 7(5; t) = Jq x^~^e~^dx, shows the lower incomplete gamma function. Thus, the mean of 
the reversed residual life of the LG distribution is given by 



m{t) 



(i-p)e^ 
{e+i)s{t) 1^0 



Er=oEi=oa)o-+iy 



(0(j + l))fe+3 



X 7(A: + 3; Ot^j + 1)) + 0(i + 1)7(A; + 2; Ot^j + 1)) 



(17) 



Using m(t) and m2{t) one can obtain the variance and the coefficient of variation of the reversed 
residual life of the LG distribution. 
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6. Bonferroni and Lorenz curves of the LG distribution 

The Bonferroni and Lorenz curves and Gini index have many apphcations not only in eco- 
nomics to study income and poverty, but also in other fields like reliability, medicine and insur- 
ance. The Bonferroni curve BF[F{y)] is given by 

Using this fact that 

i{y) = lo' ufiu)du = ^ Er=o Ei=o a) i3 + (^) ' 



X ( 7(fe + 3; dy{j + 1)) + 6{j + l)^{k + 2; 6y{j + 1)) ) , (18) 



the Bonferroni curve of the distribution function F of LG distribution is given by 



(e(i+i))fe+3 

X ( 7(A: + 3; + 1)) + e{j + l)^{k + 2; ey{j + 1)) ) , (19) 



where ^ (the mean of LG distribution) is given in (jlip . 

Also, the Lorenz curve of F that follows the LG distribution can be obtained via the expression 



LF[F{y)] = BF[F{y)]F{y). The scaled tota. 
of a distribution function F (Pundir et al., 



time and cumulative total time on test transform 



22] are defined by 



1 /■* 

SF[F(t)] = - / S(u)du, 
Jo 



1^ Jo 

and 



Cf= [' SF[F{t)]f{t)dt, 
Jo 



respectively, where S{.) denotes the survival function. If F{t) denotes the LG distribution 
function specified by dH then we have. 

The Gini index can be obtained from the relationship G = 1 — Cf ■ 
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7. Probability weighted moments 

The probabihty weighted moments (PWMs) method can generahy be used for estimating 
parameters of a distribution whose inverse form cannot be expressed exphcitly. We calculate 
the PWMs of the LG distribution since they can be used to obtain the moments of the LG 
distribution. The PWMs of a random variable Y are formally defined by 

oo 

Ts,r = E[Y'F{YY] = I y'F{yYf{y)dy, (20) 



where r and s are positive integers and F{.) and /(.) are the cdf and pdf of the random variable 
Y. The PWMs of the LG distribution are given in the following proposition. 

Proposition 3. The PWMs of the LG distribution with cdf dH) and pdf ([5]), are given by 

_ 0^+2(1^ ^ v^r ^k+j+l(_.^lfr■\fk+j+l\(r+k-l\(■,.^i+k 

^s^r — (5i_,_i)m+i l^j=0 l^k=0 2^1=0 2^m=0 V ^) m )\ k jU^^jP^ 

(21) 



r(m+s+l) _ r(m+s+2) 



Proposition 4. The sth moment of the LG distribution can be obtained putting r = in Eq. 
(I2ip . Also, the mean and variance of the LG distribution can be obtained. 

8. Mean deviations 

The amount of scatter in a population can be measured by the totality of deviations from 
the mean and median. For a random variable X with pdf /(.), cdf F{.), mean fi = E[X) 
and M = Median{X), the mean deviation about the mean and the mean deviation about the 
median, are defined respectively by 

fOO 



poo 

5iiX)= / \x- is\f{x)dx = 2fiF{fi)-2I{is), 
Jo 



and 

62{X) 



l-OO 

/ \x-M\f{x)dx = n-2I{M), 
Jo 



where I{b) = Jq xf{x)dx. 

For the LG distribution we have 



m = Er=o n=o a) u + (4t 



k r 



(0(j + l))fc+3 



x(jik + 3;9bij + l)) + 0ij + l)j{k + 2;ebij + l))) . (22) 
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Theorem 4. The Mean deviations of the LG distribution are given by 

1 - (1 + ^)e"^'' 
\-p{l + ^,)e'^^ 

and 

62 = fi-2I{M), 

respectively, where ^ is the mean of LG in Eq. I{fJ-) and I{M) are obtained by substituting 

and M in Eq. (Eg). 



9. Estimation and inference 

The estimation of the parameters of the LG distribution using the maximum hkehhood 
estimation is studied in this section. Let Yi,Y2,--- ,Yn be a random sample with observed 
values ?/i,?/2r'' jUn from LG distribution with parameters p and 9. The total log-likelihood 
function is given by 

Uy, p, e) = 2n \og{e) - n log(l + 6) + n log(l - p) + Zti log(l + " ^ Eti V^ 

-2Er=iiog(i-Ki+fe)e-'^" 

The associated score function is given by Un = {dln/dp,dln/d6)'^ , where 

dp l-p ^ 1^1=1 i_p(i+^)e-ej/i ' 



din _ 2n _ n _ sr^n _ n 





/I \ ^Vi \ 1 
'■^^9+1'' (9+1)^ 






2 



i=l 

The maximum likelihood estimation (MLE) of p and 6 is obtained by solving the nonlinear 
system C/„ = 0. The solution of this nonlinear system of equation has not a closed form. For 
interval estimation and hypothesis tests on the model parameters, we require the information 
matrix. 

9.1. Asymptotic variances and covariances of the MLEs 

Applying the usual large sample approximation, MLE of G i.e. = {p,0), can be treated 
as being approximately bivariate normal with mean Q and variance-covariance matrix, which 
is the inverse of the expected information matrix J(©) = E[I{Q)], i.e., N2{Q, J{@)~^), where 
I{Q;yobs) is the observed information matrix with elements lij = —d'^l/dOiOj with i, j = 1,2 
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and the expectation is to be taken with respect to the distribution of Y. Differentiating dl/dp 
and dl/dO, the elements of the symmetric, second-order observed information matrix are found 
to be 



1 2 



J n r, 

-'11 -(W)7-^Z^i=i 



[l-p(H-|fi)e-*^.] 



'12 



2V.n .,e-^^.[(l+||^)-^] [,,e-^..((l+|^)--J-^)][(l+|^)e-^..] 
^'=' [1-P(l+I^)e--]^ ^'=^ [l-p(l+|^)e-^.]^ 



22 - - (T^ + 2P E"=l 



(1+6))-^ (i+fl)^^i+e^ _ ''i^ y-" (1+9)2 



2 



(l-fe-''^^(l+^))^ (l-pe-«-.(l+^)) 
(l-pe-««i(l+^))3 

The elements of the expected information matrix, J(0), are calculated by taking the expectations 
of lij, i, j = 1, 2, with respect to the distribution of Y. When the expectations of lij, i, j = 1,2 
is obtained, we would have the matrix J(B), the inverse of J(B), cvahiated at provides the 
asymptotic variance-covariance matrix of MLEs. Alternative estimates can be obtained from 
the inverse of the observed information matrix since it is a consistent estimator of J~^(0). 
The estimated asymptotic multivariate normal iV2(0,/(G) ^) distribution of can be used 
to construct approximate confidence intervals for the parameters and for the hazard rate and 
survival functions. An 100(1 — 7) asymptotic confidence interval for parameters p and 9 is given 

by 



ACIp = {p - Zx\J Ipp,p + Zx\J Ipp) , 

and 



ACIg = {§ - e + Zx sfi^e), 



where ipp and /gg are the diagonal element of I{Q) and Zx is the quantile 1 — 7/2 of the 
standard normal distribution. 

We can compute the maximized unrestricted and restricted log-likelihoods to construct likeli- 
hood ratio (LR) statistics for testing some LG sub-models. For example, we can use LR statistics 
to check whether the fitted LG distribution for a given data set is statistically "superior" to the 
fitted Lindely distribution. In any case, hypothesis tests of the type Hq : B = Bq versus 
Hi : B = Bo can be performed using LR statistics. In this case, the LR statistic for testing 
Hq versus Hi is w = 2{/(B) — i(Bo)}, where B and Bq are the MLEs under Hi and Hq. The 
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statistic w is asymptotically (as n — >■ oo) distributed as Xk^ where k is the dimension of the 
subset of interest. 



9.2. An EM algorithm 

Let the complete-data be Yi, • • • ,Yn with observed values yi, ■ • ■ ,yn and the hypothetical 
random variable Zi, - ■ ■ ,Zn. The joint probability density function is such that the marginal 
density of Yi, • • • , 1^ is the likelihood of interest. Then, we define a hypothetical complete-data 
distribution for each (Y^, Zj) i = 1, ■ ■ ■ ,n with a joint probability density function in the form 



g{y, z; 6) = (1 - + y)e-'y [p{l + ^)e-'^^] (23) 



where Q = {p,9), y > and z G N. 

Under the formulation, the E-step of an EM cycle requires the expectation of {Z\Y; Q^^^) where 
Qi^) = (a^^\ p(^\j(^\6('^^) is the current estimate (in the rth iteration) of 0. 
The pdf of Z given Y, say g{z\y) is given by 

g{z\y) = z[p{l + J^)e-^yY-\l - pil + ^)e-'')' ■ 

Thus, its expected value is given by 

, , , (i+p{i + ^)e-'y) 

E[Z\Y = y] = — ^-P^ -. 

The EM cycle is completed with the M-step by using the maximum likelihood estimation 
over 0, with the missing Z's replaced by their conditional expectations given above. 
The log-likelihood for the complete-data is 



/;(y, z; G) cx JZti log + 2n log(^) - n log(l + 6) + n log(l - p) - ELi Vi 

+ U=i iog(i + y^) + Er=i(^^ - 1) log + • 

The components of the score function U*{&) = i^,^)^ are given by 

din _ y^"- Zj-l _ n 
dp Z_/i=l p 1— p' 

^ _2n__«__Y-n „, , V" (~ ^\ ^^^^ ' 

ae ~ e e+i i^i=i ^ 2^i=i\^i ^) (i i e^\)c-^"i ' 
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From a nonlinear system of equations U*{Q) = 0, we obtain the iterative procedure of the EM 
algorithm as 

0(r+l) 



1^ * ' fi+|l:±^)e-9'^+^)«» 



g(r + l) 5)(r + l)_)_]^ Z-^l = l ' 



where ^(''+^) is found numerically. Hence, for i = 1, - ■ ■ ,n, we have that 

(r) _ l^^/^ '-'^^gW+l^^ ^ 



10. Applications to real data sets 

In this Section we fit LG distribution to two real data sets and compare the fitness with the 



extended Lindely (Bakoucha et al., 
densities are given by 



), Lindely, Weibull and exponential distributions, whose 



fELix;a,9,j) = "'^\lllfr' [7(l + ^ + ^x)(gxp-i-a]e-(^")", x,9,j>0, aeR-\J{0,l}, 
fL{x;e) = ^{l + x)e-'^, x,e>0, 

fE{x-e) = ee-'^^, x,e>o, 

respectively. The first data set represents the waiting times (in minutes) before service of 100 



bank customers. This data is examined and analyzed by Ghitany et al. js] in fitting the Lindely 
distribution. 

In the second data set, we consider vinyl chloride data obtained from clean upgradient 
monitoring wells in mg/L; this data set is used by Bhaumik et al. in fitting the gamma 
distribution for small samples. 

In order to compare distributions, we consider the K-S (Kolmogorov-Smirnov) statistic with 
its respective p-value, — 21og(L), AIC (Akaike Information Criterion), AICC (Akaike Informa- 
tion Criterion Corrected), BIC (Bayesian Information Criterion), AD (Anderson-Darling) and 
CM (Cramer-von Mises) statistics for the two real data sets. 

The best distribution corresponds to lower —2 log(L), AIC, AICC, BIC, AD and CM statistics 
values. Table 1 shows parameter MLEs with the standard errors according to each one of the 
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Table 1: MLEs (STDs) of the fitted distributions corresponds to data 1 and 2 





iVloclcl 


a 


7 


2 

u 


P 




LG 






2027(0 0346') 


-0 2427f0 52701 




EL 


-le-08(4 3212) 


1 4585(0 ICQS") 


9128f0 0066) 




Data 1 


T rinnplv 












Weibull 




1 4:585(0 inQS") 


0305 fO 0096") 






J— /jt^ 






n 1 01 2^0 01 01 "1 






LG 






0.5458(0.2305) 


0.6346(0.3079) 




EL 


-1.4435(3.9990) 


1.1380(0.4395) 


0.2937(0.4138) 




Data 2 


Lindley 






0.8238(0.1054) 






Weibull 




1.0102(0.1327) 


0.5202(0.1177) 






Exp 






0.5321(0.0913) 





five fitted distributions for the two real data sets. Also, Table 2 shows the values of — 21og(L), 
K-S statistic with its respective p-value, AIC, AICC, BIC, AD and CM statistics values. 

The values in Table 2, indicate that the LG distribution is a strong competitor to other 
distributions commonly used in literature for fitting lifetime data. These conclusions are cor- 
roborated by the fitted pdf, cdf and survival functions of the LG, EL, Lindley, Weibull and 
exponential distributions in Fig. 3. We observed a difference between the fitted curves, which is 
a strong motivation for choosing the most suitable distribution for fitting these two sets of data. 
Prom the above results, it is evident that the LG distribution is the best distribution for fitting 
these data sets compared to other distributions considered here. 
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Table 2: K-S, p-values, -21og(L), AIC, AICC, BIG, AD and CM corresponds to data 1 and 2 
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EL 


0.0813 


0.9780 


110.6 


116.6 


117.4 


121.2 


0.2402 


0.1159 


Data 2 


Lindely 


0.1326 


0.5880 


112.6 


114.6 


114.7 


116.1 


0.6873 


0.1993 




Weibull 


0.0919 


0.9364 


110.9 


114.9 


115.3 


118 


0.2826 


0.1242 




Exp 


0.0889 


0.9508 


110.9 


112.9 


113 


114.4 


0.2719 


0.1214 



11. Conclusion 

We propose a new two-parameter distribution, referred to as the LG distribution wliich 
contains as special the Lindley distribution. The hazard function of the LG distribution can be 
decreasing, increasing and bathtub-shaped. Several properties of the LG distribution such as 
moments, maximum likelihood estimation procedure via an EM-algorithm, moments of order 
statistics, residual life function and probability weighted moments are studied. Finally, we fitted 
LG model to two real data sets to show the potential of the new proposed distribution. 
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Appendix 

Proof of Theorem 1: 

The behavior of f{y) is completely similar to the behavior of log(/(y)). For simplicity we 
consider the behavior of log(/(y)). The derivation of log(/(y)) with respect to y is given by 

^-[log(/(y))] - "''^^ + 0){l-e-ye)- p{l + (1 + yfe^) 



dy' ^^'^^"^ {i + y){^e<)y{i + 0)-p{i + e + ey)) 

The dominator of ^[log(/(y))] is positive for each value y, p and 9, therefore we only consider 
its nominator. Suppose that 

g^[y) = e'y{i + e){i-e-ye), 

and 

52(y)=p(i + (i + yW. 

Note that 5i(0) = 1 — 9'^ and (72(0) = p(l + 9'^). (72 (y) is an increasing function of y for 
p > 0, ^ > and a decreasing function for all values p < 0, ^ > 0. Also, gi{y) is a decreasing 
function of y for all values p and 6. Consider the following comparisons between gi{y) and g2{y)- 



(i) For all values of p and 9 for whichp > jq;^, then gi{y)—g2{y) > and hence a^[log(/(y))] < 

which implies the decreasing behavior of f{y). 

(ii) For all values of p and 6 for which p < j^^, then there exist a point y* > such that (a) 

for each y < y*; yi(y) — y2(y) < and hence ^[log(/(y))] > which implies the increasing 
behavior of /(y); (b) for each y > y*; gi{y) — g2{y) < and hence ^[log(/(y))] < which 
implies the decreasing behavior of /(y). Thus in this case the pdf /(y) has a unique mode 
at point y* and also is unimodal. 

Proof of Theorem 2: 

The hazard rate function of the LG distribution in ([7]) is given by 

. 9H9 + l){l + y) 

{1 + 9 + 9y)[9 + 1 - p{l + 9 + 9y)e-^yy 

The derivation of log(/i(y)) with respect to y is given by 

d_ _ e'yjl + 9)-p{l + 9 + 9y){l + (1 + yf9^) 

dy ^ ^^''^ (1 + y)(l + + 9y) [e^J/(l + 9) - pil + 9 + 9y)] ' 
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The dominator of ■^[log{h{y))] is positive for each values y, p and 0, therefore we only consider 
its nominator. Suppose that 

hiiy) = e('y{i + e), 

and 

h2{y)=p{l + e + ey){l + il + yfe^). 

hi (y) is an increasing function of y where hi{0) = 1 + 9 > and hi (y) > for each y > and 
^ > 0. 

/t2(y) is a polynomial function of y which is an increasing function of y for < p < 1 and 
decreasing for p < and /i2(0) = p(l + 0){1 + 9^). Note that 

h'^iy) = pe{3{e{l + y)f + 20(1 + y) + 1), 

and 

h'i{y) = 2pe\m'^ + y)) + ^)- 

Comparison of functions hi{y) and h2{y) implies the following results: 

(i) li 1 + 9 < p{l + 9){\ + 0"^) or p > jq^, then there exist a unique point y* > such that 

^i(y) — ^2(y) < for < y < y* and ^i(y) — h2{y) > for y > y*. In this case h{y) is 
first decreasing for < y < y* and then increasing function of y for y* < y, therefore h{y) 
is bathtub-shaped. 

(ii) If p < i^, then there exist two points yl > and y| > such that hi{y) — ^2(y) > 

for < y < yl, hi{y) - h2{y) < for y^ < y < y^ and hi{y) - h2{y) > for y > y|. In 
this case h{y) is increasing for < y < y*, decreasing for y\ <y <y2 and then increasing 
function of y for y > y^, therefore h{y) is firstly increasing and then bathtub-shaped, in 
this case. 
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